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ABSTRACT 


In order to determine the Fourier transform of a quasi-perlodic time 
series (linear problem), or the power spectrum of a stationary random 
time series (quadratic problem), it is desirable that data be recorded 
without interruption over a long time interval. In practice, this may 
not be possible. The effect of regular interruption such as the day / 
night cycle is well known. We here Investigate the effect of irregular 
interruption of data collection (the "breaking" of the window function) 
with the simplifying assumption that there is a uniform probability p 
that each interval of length T , of the total interval of length T * NT, 
yields no data. 

For the linear case we find that the noise-to-slgnal ratio will 

-1 -2 

have a (one-sigma) value less than e if N exceeds p (l-p)e 

For the quadratic case, the same requirement is met by the less restric- 

-1 -1 

tive requirement that N exceed p (l-p)E 

It appears that, if four observatories spaced around the earth were 
to operate for 25 days, each for six hours a day (N - 100), and if the 
probability of cloud cover at any site on any day is 20% (p » 0.8), 
the r.m.s. noise-to-slgnal ratio is 0.25% for frequencies displaced 
from a sharp strong signal by 15 yHz. The noise-to-slgnal ratio drops 
off rapidly if the frequency offset exceeds 15 uHz. 



EXAMINATION OF TIME SERIES THROUGH RANDOMLY BROKEN WINDOWS 


I. INTRODUCTION 

In many astrophyslcal problems one is concerned with the study of 
time series. It often happens that the property of particular Interest 
is the spectrum of the time series. In principle, one may determine a 
time series to a prescribed accuracy by making measurements, without 
interruption, over a sufficiently long time Interval. In practice, the 
length of time over which the variables may be measured will be limited. 
Moreover, measurements may necessarily be interrupted (or otherwise Impaired) 
for one reason or another. The relationship of the spectrum determined 
by limited, interrupted measurements to the intrinsic spectrum has been 
the subject of many investigations, as recently reviewed by Deeming (1975). 

If the original time series is denoted by x(t) , one may regard the 
measurements y(t) as being determined by 

y(t) - f(t)x(t), (1.1) 


where f(t) is the "window function." We regard x, y and f as being simple 
scalar functions but the procedure may be generalized to replace x, y by 
vectors and f by a tensor. 

We use the Fourier transform notation 



where the limits of integration are to be taken to be - 00 
limits are not explicitly specified. 


( 1 . 2 ) 

U.3) 

to + 00 if other 
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If we are interested in determining x(a>), the Fourier transform of 


the time series x(t), then we may use the relation 


y(<o) 


/• 


dto' ?(co') x(a»-a)') 


(1.4) 


to relate the Fourier transform of the measured time series y(t) to that 
of the original time series x(t). 

We are interested in the possibility that f(t) may be regarded as a 
random variable, expressible as 


f(t) « Fttjc^.cij, .. .,a N ) , 


(1.5) 


where . a N are independent random variables with specified expec- 

tation distributions. By the central limit theorem (Papoulis, 1965), we 
expect that the random variable f (or its Fourier transform f) will have 
a distribution approximately Gaussian in form if N is not a small number, 
so that an adequate representation of f would be given by its mean < f > 
and its standard deviation o(f). 

If, on the other hand, x(t) is a random time series, we will be 
concerned with the autocorrelation function R x (t), defined by 

R x (t) - <x(t') x ( t ' +t)> , (1.6) 


and the power spectrum of the time series, defined as the Fourier transform 

of R (t) : 
x 


R x (t) 


S x (o>) 



(1.7) 


( 1 . 8 ) 
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On noting that 


<x(3) x (oj , )> - S x («) 6(<u + <a') (1.9) 

and evaluating <y(co) y (a»’)> , we may verify that 

S (w) - 

y 

where 

w(o>) » f(co) f (-to). (1.11) 

Clearly the function «(a>) represents the capability of the measurement 
process, described by the "window function" f(t). to determine the power 
spectrum S^Coj) . The function w(u)) may be expressed in terms of the inde- 
pendent random variables 

w(w) - W(&>;a 1 ,a 2 ,...,a N ). (1.12) 

Once again, unless N is a small number, we expect that the distribution 
of w will be approximately Gaussian so that it may be characterized by 
its mean value <w> and standard deviation o(w). 

This article was prompted by a problem related to the determination 
of normal modes of oscillation of the sun, as determined by measurement of 
the photospherlc velocity field. Measurements have been presented by 
Deubner (1975) and by Rhodes et al. (1977), and their theoretical inter- 
pretation discussed by Ulrich and Rhodes (1977) and by Ulrich et al. (1978). 
For optimum determination of the power spectrum of the velocity field 
(expressed as a function of wave number) , it is clearly desirable to make 
observations without interpretation over as long an interval as possible. 
Away from polar regions, observations from a single station are interrupted 


f 


d<u' wfo>') 


S x (co-w') 


( 1 . 10 ) 
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by the day-night cycle which leads to unacceptable aliasing of the data. 
Observations made from a spacecraft In polar orbit would obviously yield 
un-aliased data of higher quality and higher frequency resolution. Obser- 
vations made from the south pole during austral midsuoner can lead to 
several days of uninterrupted observation and to still longer intervals 
with occasional, irregular interruption. It is also possible to select 
three or four stations around the earth which, in the absence of any cloud 
cover, could give continual coverage of the sun for many weeks. However, 
one must anticipate that some of the data would be lost by cloud cover. 

It is clearly desirable that one should be able to make some esti- 
mate of the accuracy with which oscillation modes may be determined when 
it appears possible to observe the sun over a long interval of time losing 
some blocks of time because of cloud cover. The purpose of this article 
is to develop a model which enables us to address problems of this type. 
After presenting a few general formulas, we shall simplify the problem 
considerably by supposing that observations are made over a large number 
l of equal time intervals, each of length T , so that the total time 
interval T is given by 


T - Nr . (1.13) 

With certain additional simplifying assumptions, we shall consider the 
statistical properties of the functions f(w) and w(co) which are repre- 
sentative of "randomly broken" window functions. 
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II. MATHEMATICAL MODEL 


In the case that the window function f(t) is expressible in the form 
(1.5), in terms of a number of random variables, we wish to study the distri- 
bution of the functions f(w), w(co) , entering equations (1.4) and (1.10). 

We suppose that the distribution of the variables to is given by 
the probability function 9(0^, ..., o^) such that P(o^, ..., a^da^ ..., da^ 
is the probability of finding in the range to a^ + da^, etc. Then the 
expectation value of the quantity f(<*>) is given by 

<f(w)> ■ ^d N a P N (a) F(u;a 1 ,.. .,0^), (2.1) 

N 

where d a denotes da^..., da N , and P N (a) denotes P(ap .... a N ) . If we 
use the following notation for the variance of a complex variable of a 
complex variable z, 

o 2 (z) - o 2 (z r ) + cr 2 (z i ), (2.2) 

where z and z^ are the real and Imaginary parts of z, then noting that 
f(-a>) is the complex conjugate of f(a'), we see that 


a (f(o>)) - <f(co) f (-«)> -<f(oj)> <?(-«)> . (2.3) 


The first term on the right-hand side may be evaluated from 


<w(oj)> = <f (oj) f (- oi)> 


■ P H 


(a)F(<o;a 


1 * 


,a N ) 


F (-oj; a. 


,C V 


(2.4) 


We see that equation (2.4) also gives the expectation value of the 
"window spectrum" w(&) which appears in equation (1.10) and is appropriate 
for the discussion of stationary random time series. The variance of this 
function is given by 


2 

0 (w) » <v(oj) W (- 0))> - <w(03» <w(-0))> 


(2.5) 
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where 


<w(w) w(- <u)> 





( 2 . 6 ) 


As Indicated in the introduction, we intend to consider the case that 
the observing time tg to t^, of length T, is divided into N equal Intervals 
bounded by times t^, ... where 


t - t_ + m 
n 0 


(2.7) 


so that we may adopt the form 

F(t;aj, . . . , ctjj) " a n | h ^ C ” C n - 1^ ~ C n^| (2.8) 

n-1 ' 

where h(t) is the Reavyside function: 


(2.9) 


h(t) - 0 if t<0, | 

- 1 if t >0. ) 

We also assume that the intervals are statistically independent, so that 
we may write 


P^(&)d ot = P(oi^ | • • • , , « * * i dflljj m k‘“ 1^ ^ a i | * " * | ' 

If we assume that, for each Interval, there is a (uniform) probability p 
that the window is open and probability 1-p that it is closed, then 


P(On) - p 6(ct n -l) + (l-p)<5 (a n ) . 
In evaluating (f(<u)), given by (2.1), we will use 

<ot n > " p * 


( 2 . 11 ) 


( 2 . 12 ) 


In evaluating the quantity given by equation (2.4), we will need to 

evaluate (a at \ which is clearly given by p if m ^ n but by p if m * n. 
\ tn n< 
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Bence 


<<*a >- p 2 + ^p-p 2 > 6 «»» (2 * 13) 

on on 

where is Che Kronecker function. In evaluating the quantity given 

by (2.6), we need to evaluate the expectation value of a a a a . By con* 

o n p q 

slderi; > the various possibilities (a,n,p,q all different: two of them 

the same, etc.) we find that 


< a a a a > 

map q 


p 4 + (p 3 -pS(6 i + 5 + 

on op 


oq 


+ 6 +6 
np 


nq 


6 ) 

P«J 


+ (p 2 - 3p 3 + 2p 4 ) (6 + « + 6 + 5 ) 

npq apq anq onp 

♦ (p - 4p 2 + 6p 3 - 3p 4 ) 6^, (2.14) 


where 6 

onp 


1 if m • n ■ p otherwise 0, and 6 

mnpq 


is defined slailarly. 
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III. EVALUATION OF MODEL 

For simple (non-random) tima series, aquation (1.4) givas tha relation- 
ship batvaan tha Fouriar transforms of tha original and measured time series. 
In this context, the properties of the random window function f(t) may be 
characterized by ^f(w)^ and 0 2 (f). 

On substituting the form (2.8) Into (1.5), we find that 


'(“S*! Vw 2 “n + . 

n » 1 ' ' 

On using (2.1) and (2.12>, we obtain 

, i T la, ( t n + t u ) 

<f(u)> - 2 ^- T p sinc(-| o> T) e , 


(3.1) 


(3.2) 


where sine 0 • 0 * sin 0 . 

On using (2.4), we find that 


- £> 2 [ 


<w(u)> 

Hence, using (2.3), we obtain 


p 2 sinc 2 (yuT) + N *p(l- p)sinc 2 ( 


7“ T> ] • 


(3.3) 


0 2 (f) - 2 T 2 N" 1 r - p) sine 2 cot) . 


(3.4) 


For evaluating the effects of "breaking* 1 of the window function, it 
is convenient to normalize the standard deviation with respect to the maxi- 
mum value of f(oo), which is the value at u ■ 0. Accordingly, we introduce 
the definition 


. o (£(«)) 

t i < “ ) <7(5)> ■ 


(3.5) 


For the case under consideration, this has the form 


- , . - 1/2 - 1/2 „ . 1/2 | . 1 ,, 
2-^(0)) » N p (1 - p) |sinc j cut | . 


(3.6) 
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For discussion of the properties of readonly broken windows in the 
study of stationary random tine series, it is necessary to evaluate the 
naan value and standard deviation of w(<o) . The former la given by equa- 
tion (3.3). The first tern inside the brackets has the sane fore as arises 
in the non-random case (p ■ 1). The second term represents a change in the 
mean spectrum, so it Is convenient to Introduce the symbol Ap for the 
ratio of the additional term to the maximum value of the principal term: 

dp • N * p” 3 (1-p) sine 2 . (3.7) 

On writing equation (2.6) in the simpler form 

<w(m)w(- <u)> ■ <F(u>;a)F(a);a)F(- oj; a)F(- a>; a)> (3.8) 


and using equations (1.3) and (2.8), we see that 

1 _ / iut iwt A / iut 

<w(w)w(-<u)>-/— r 2 <Q, m a n a p a q > r "" / \* 

\2iudI mnpq 


lwt ,\ 


(3.9) 


On using equation (2.14), we see that this may be expressed in the form 


<w(oj)w(-«)> - — |p\ + (p J - p“) (E 2 + Z* + 4E 3 ) 


3 4, 


(2iru) 


+ 2(p 2 - 3p 3 + 2p 4 )(E 4 + eJ) 


(3.10) 


+ (p - 4p 2 + 6p 3 - 3pS E 5 J 


where 
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■ (■&) jo 4 "‘“‘(j" 1 ) 

+ 2(p 3 -p 4 )n- 2 .lDC 2 (i<uT).inc 2 (i OT )|^ 
+ A (p^— p^) N ^ 8inc^|Y a) ^'j 8 ^ nc ^|y w T j 
+ 4 ( p 2 — 3p 3 + 2p 4 )N' 2 sinc 2 |iojT| sinc 2 ||a)Tj 


+ 


(p - *P 2 + 6P 3 - 3 P 4 )n' 3 



On using equations (2.3) and the definition 


r 2 (co) 


0 (w(co)) 

(¥^F 


we find that 


2 2 (co) * |2 N 1 p 1 (l-p) sine 2 ^ 8inc 2 |^Tj 

, „„-2 -1,, x . 2/1 . 2/1 \ sincaT 

+ 2 N p (1-p) sine \j oTjsinc 

+ N 2 p 2 (3 - lOp + 7p 2 ) sinc 2 |^- sinc 2 |-j cdt| 

) 1/2 

+ N 3 p 3 (1 - 4p + 6p 2 - 3p 3 ) sinc 4 |-j 0 )t|| . 


(3.13) 


(3.14) 


(3.15) 
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IV. DISCUSSION 


We see from the preceding sectior that the effect of a random "break- 
ing" of the window function is to produce an aliasing of any signal. This 
effect is described by the function Z^ or by Aj and Z^ as given by equa- 
tions (3.6), (3.7) and (3.15). 

For the "linear" problem, as described by equation (1.4) the mean 
Fourier transform of the window function, as given by equation (3.2)» 
has the same form as it does in the non-random case, although it is 
reduced by a factor p. The standard deviation is characterized by Z^, 
defined by equation (3.5) and given by equation (3.6). 

It is convenient to introduce the notation 

o> T - 2T* 1 , <a T - 2t _1 . (4.1) 

On noting that sine 0 < 1 for 0 ^ 1, and that |s-<nc 9| < 9 _1 for 0^1, 
we see that where 

S 1 (0J) - N _1/2 p" 1/2 (l-p) 1/2 , 0)^C0 t ) 

1(4.2) 

S 1 (o>) - N' 1/2 p' 1/2 (l-p) 1/2 (G) T /u)) , 0) ^co t . I 

Hence the aliasing is most severe within the range of a few times of 
the strongest signal. 

We may infer from the above restriction the minimum number of 
intervals necessary to ensure that is below an assigned level e for a 
given value of p. 

We see from (4.2) that we require N > Nowhere 



If, for instance, p ■ 0.8 and we require that E^ < 0.05, N must be at 
least 100. 

For the "quadratic" problem in which we are determining the spectrum 
of a stationary random time series, the aliasing is described by the 
functions and given by equations (3.7) and (3.15). 

We see from equation (3.3) that the second term in brackets is smaller 
than the envelope of the first term, and so may be neglected, for oi < , 

where 




- » u hi-e) in p- V \ 


(4.*) 


For (o > the second term produces a "tail" to the principal contribu- 
tion to<w(co)> . By an argument similar to that leading to equation (4,2), 
we find that < D^, where 


D 2 (o>) » N* 1 p" 1 (l-p) , co< U) T , 


D 2 (co) - N* 1 p" 1 (l-p)(a) T /co) 2 t a) > o> t . 


(4.5) 


Now consider the four terms in the second bracket in equation (3.15). 

It is clear that the third term may be ignored by comparison with the first 

since it has a similar dependence on co but includes an extra factor 

The second term may also be ignored by comparison with the first: the 

extra factor N ^ (sin cuT)/(sin cot) has a maximum value of unity* and an 

- 1/2 

RMS value of order of N 

In comparing the fourth term by comparison to the first, we see that 
the ratio is given by 

sinc^(iaiT\ 

R - Q(P) 4 (4 - 6) 

N sine (-^oiT) 
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where 



(4.7) 


It is easily verified that Q(p) <0.5 in the range 0.5 < p < 1. hence 
R has a ma x imu m value of order 0.5 at values of (o for which «T ■ 2nir. We 
find that, when R is averaged over frequency, it varies with N as N - ^. 

Hence we may, to sufficient approximation ignore the fourth term and so 
replace (3.15) by 


2 2 (w) * 2 1 ^ 2 N“ 1/,2 p‘’ 1 ^ 2 (l - p) 1 ^ 2 sinc(jwT)||sinc(j<i)r)J. (4.8) 


We find that < S 2 , where 


- /x . ,l/2 M -l/2 -1/2,. .1/2 

S 2 ((o) - 2 N p (1-p) Uj , 


M«> - 2 1/ V 1/2 p’ 1/2 (l-p) 1/2 -I , 0) T< <o < 0) . > (4.9) 


,l/2 H -3/2 -1/2,. .1/2/ M 2 

2 N P (1 “ P) \— J . w > . 


S 2 (o>) 


We see that, for the same values of N and p, the maximum value of Z^ is 
1/2 

2 times larger than the maximum value of Z ^ . Hence the minimum number 
of intervals necessary to ensure that Z ^ is below an assigned level, 
for various values of p, is twice the corresponding value of N^, given by 
equation (4.3). 

However, the quadratic case is more complicated than the linear case 
in that E^Co)) * s more complicated than E^(co), and A^(c^) is nonzero (whereas 
A^(o)) is zero and has been neglected). 
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On noting that the dominant term of (3.3) (that which survives in the 

2 

nonrandom case that p ■ 1) varies as (w^co) for a> > <o T , we see that Z 2 (<o) 
Is less than the tall of the dominant term, ^nd so may be neglected, for 
co < o>j , where 


a-,r v \ . 


<4.X0) 


On the other hand, we find that S-(a') is less than D (< u) for co > co , where 

2 2 C 


« c - 2 l 'V'V' ! d- f )- I, S » 2(Dj. . 


(4.11) 


Hence we may, to adequate approximation, ignore E 2 (co) in assessing the 
aliasing which occurs in the quadratic case. 

We see from (4.5) that the minimum value of N necessary to ensure 
that A 2 (<x>) i® less than some maximum value e is given by N > Nj, where 

N 2 - p" 1 (l-p)e“ 1 . (4.12) 


We see from (4.3) that is smaller than by the factor £. Hence 
aliasing is likely to be less serious in the quadratic case than :*.t is in 
the linear case. 

In order to assess the implications of the present model concerning 
ground-based observations of solar oscillations, one will need to have 
detailed estimates of the expected spectrum (in particular, the spacing 
and relative power of nearby lines) and the expected cloud cover at three 
or four observatories positioned round the world* It is also desirable 
that the present model should be extended by considering separate values 
of p for each of the observatories, and possibly by taking into account 
the correlation between cloud cover on consecutive days. 
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Nevertheless, we can Illustrate the results of this model by consider- 
ing a hypothetical situation. Suppose that four observatories are located 
around the world in such a way as to give continuous coverage (in the 
absence of cloud cover), and that these observatories are operated for 25 
days. Then N ■ 100. Suppose that, for any observatory on any day, there 
is 20X probability of cloud cover so that p - 0.80. We find from (4.1) 
that (using v - o)/2ir), v^. - 15uHz. For frequencies less than this value, 
(4.5) shows that the aliasing amounts to 0.25% or less. For frequencies 
above 15y Hz, the aliasing drops off rapidly. 

Although it will be necessary to make more detailed and specific 
calculations to draw definite conclusions, it appears from the above simple 
example that it may be possible to carry out high-quality studies of solar 
oscillations from a chain of ground-based observatories. 

This work was supported in part by NASA Grant NGL 05-020-272, Office 
of Naval Research Contract N00014-75-C-0673, and the Max C. Fleischmann 
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